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1 Introduction 



1.1 Motivation 

The simplex algorithm is widely used to solve linear programs. It works well in practice, 
though often one cannot prove that it will. A parameter in the algorithm is the rule that 
selects one move among possible moves ("pivots") that decrease the objective function. De- 
terministic pivot rules are known to be possibly very far from optimal. For example, consider 
problems with the number m of constraints of the same order as the dimension, n. For vir- 
tually every deterministic pivot rule there is a problem for which the algorithm will take 
exponential time, although it is conjectured that there exists a descent path whose length 
O(n). Variants of the original argument by Klee and Minty (KM72j are cited in |GH Z98, 
page 2]. 

Randomized pivot rules appear to do better. According to |GHZ98j . several of the 
most popular randomized pivot rules appear to have polynomial - even quadratic - running 
time. Rigorous and general results on these, however, have been hard to come by. When 
one restricts to a narrow class of test problems, it becomes possible to obtain some rigorous 
results. Gartner, Hank and Ziegler GHZ98 consider three randomized pivot rules. Relevant 
to the present paper are their results on the random edge rule, in which the next move is 
chosen uniformly among edges leading to decrease the objective function. They analyze 
the performance of this rule on a class of linear programs, the feasible polyhedra for which 
are called Klee-Minty cubes, after KM72 . Such cubes are good benchmarks because they 
cause some pivot rules to pass through a positive fraction of the vertices. They prove that 
the expected run time is quadratic, up to a possible log factor in the lower bound: 

Theorem 1 (GHZ) The expected number, E n of steps taken by the random edge rule, 
started at a random vertex of a Klee-Minty cube, is bounded by 



Their lower bound rules out the possibility that E n ~ npolylog(n) which was twice conjec- 
tured by previous researchers [P^82l page 29], |Kel81| . They guess that the upper bound is 




Here, H n = Sj=i J ~ logn is the n th harmonic number. 
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the correct order of magnitude, and state an improvement in the upper bound from (l/2)n 2 
to 0.27 . . . n 2 , whose proof is omitted. 

The method of analysis in GHZ98 is via a combinatorial model due to |AC78j which 
describes the progress of the algorithm as a random walk on an acyclic directed graph. In 
their model, vertices are bijectively mapped to sequences of 0's and l's of length n, and 
each move consists of flipping a 1 (chosen uniformly at random) to a 0, and simultaneously 
flipping all bits to the right of the chosen bit. It was in this form that the problem came 
to our attention. Indeed, the remainder of the paper is framed in terms of a variant of this 
model, which we find to be an intrinsically interesting model. Our main result, Theorem[2J 
closes the gap left open in GHZ98 , proving that the upper bound is sharp to within a 
constant factor and obtaining upper and lower bounds differing by a factor of less than 2. 
We have moved the model to continuous time and made n infinite, since from our view as 
probabilists this is the most natural way to frame such a model. Nevertheless, our results 
apply to the setting of IGHZ98j as well (Corollary |SJ). Although the model seems simple, 
we remark that we were unable to prove many things about the model, including whether 
certain limits exist. 

1.2 Statement of the model 

The one-dimensional integer lattice is decorated with 0's and l's, arbitrarily except that 
there must be some point to the left of which lie only 0's. Each site has a clock that goes 
off at times distributed as independent mean-one exponentials. When a clock rings, if there 
is a there nothing happens, but if there is 1 there, then it and all (infinitely many) of the 
sites to the right flip as well - 0's become l's and l's become 0's. Later arguments will use 
random variables involved in the construction of this continuous time Markov chain, so we 
give a formal construction as follows. 

Let S be the subset of {0, 1} Z consisting of sequences of 0's and l's that have a leftmost 
1 (equivalently, have finitely many l's to the left of the origin). Let {N(j,t) : t > 0}j£z 
be a collection of IID Poisson counting processes, that is, step functions increasing by 
1 at random times, which we denote having independent increments distributed as 
exponentials of mean 1. As usual, the filtration is defined by letting Tt denote the cr-field 
generated by {N(j, s) : s < t}. For any oj 6 S, we define a Markov chain starting from oj as 
follows. If <jj is the zero string, the chain remains at u. Otherwise, let i be the position of 
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the leftmost 1 in uj. First, fix j > i € Z and define a Markov chain on configurations on the 
index set {i, i + 1, . . . , j} in the finitary model: each site s attempts to flip at times £ Sjr for 
all r > 0. The flip is successful if and only if there is a 1 in position s at time £~ r , in which 
case sites s + 1, . . . j flip as well. This is a well defined process because for each T there are 
only finitely many jump times £ Sjr with i < s < j and r <T. It is clear that for j' > j the 
Markov chain thus defined on configurations on projects down to the Markov chain 

on configurations on by ignoring the bits to the right of j. By Kolmogorov's Extension 
Theorem there is an inverse limit as j — ► oo. This is a Markov chain, which we denote {ojt}, 
taking values in trajectories u : [0, oo) — > S, and having jumps at a countable set of times 
£ s ,r-) at which a 1 flips to a and all bits to its right flip as well. 

We are interested in the speed at which the leftmost 1 drifts to the right. Because this 
chain converges weakly to the zero state, it is convenient to renormalize by shifting to the 
leftmost 1. Consequently, we define the space S to be the space of those sequences of 0's 
and l's indexed by the nonnegative integers that begin with a 1. We define two functionals 
fizeros and fiones on S by letting fiones (x) > 1 be the number of leading l's: 

fiones(x) := inf{j > 1 : x(j) = 0} 

and letting Bzeros(x) > be the number of successive 0's after the first 1: 

fizeros(x) := —1 + inf{j > 1 : x(j) = 1} . 

Let Bones j denote the set {x : fiones (x) = j} which form a partition of E, and let {fizeros j} 
denote the analogous partition with respect to the values of fizeros. 

We now define a Markov chain on the space S whose law starting from x G E is denoted 
Q x . Pick uj G S such that the leftmost 1 of a; is in some position i and uj(i + j) = x(j) 
for all j > 0. We construct the Markov chain {X t } on E as a function of {M t } as follows. 
First, define do to be 0, and iq to be the position of the leading 1 in uj. Now recursively 
we let a n be the first time after a n -i for which N(i n -i, •) increases. For a n _i < t < a n we 
let X t be M t shifted so that i n -\ is at the origin (and ignoring negative indices). We let 
fijump n denote fiones((j^) and we let i n = i n ~i + fijump n . 

Sometimes, it is convenient to look at the chain sampled at times a n . Thus we let 

y n ■= x an 
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which is now a discrete-time Markov chain. We let P x denote the law of this chain starting 
from x. 



Since the conditional distribution of a n — o~ n -\ given T Cn _^ is exponential of mean 1, it 
follows that a n /n — > 1. The distance the leading 1 has moved to the right by time t is the 
sum Yln-a n <t ^j um Pn> an d therefore the average speed fispd(n) up to time a n is the random 
quantity 

n n 

fispd(n) := a' 1 ^ fijump^- ~ n" 1 ^ Bjump^ . (1) 
j=i j=i 

It is not a priori clear, nor in fact can we prove that Bspd(n) has a limit as n — > oo. 
Consequently we define 

Binf-spd := liminf„8spd(n) ; 
fisup-spd := limsup„fispd(n) . 



Problem 1: Show that the limiting speed exists. 



The Markov chain {1^} is a time-homogeneous process on a compact space, so taking cesaro 
averages of the marginals, we see these must have at least one weak limit. Any weak limit 
is a stationary distribution. 

Problem 2: Show that there is a unique stationary distribution for the chain 

{Yn}- 

This would imply a positive solution to Problem 1. In particular, it would imply that 



n „ 
1 fjjumpj — > / 8jump 1 dir 

.7=1 J 



n 

— 1 ' 

n 
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where tt is the unique stationary measure; hence the speed would not only exist but would 
be almost surely constant independent of the starting state. Although we do not have a 
solution to Problem 2, we believe something stronger may hold. 
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Problem 3a: Let T% denote the composition of the measure tt with a trans- 
lation by j bits, e.g., if A is the event that there is a 1 in position r, then 
T j tt(A) = tt(A j ), where Aj is the event that there is a 1 in position r + j. Prove 
that T^-ir — > M where tt is the stationary measure and M is IID fair coin flipping. 

Problem 3b: Prove or disprove that the unique stationary measure tt is equiv- 
alent (mutually absolutely continuous) to M. 

To see why we believe (3a) to be true, consider a window of positions of any size k, 
located far to the right of the initial 1. Between each time a clock rings in this window 
there are many times a 1 turns to a to the left of the window. Each time this happens, the 
bits in the window all flip. Projecting configuration space onto what is visible in the window, 
and again onto a space of half the size by identifying each configuration with its complement, 
it seems reasonable to approximate the projection by a Markov chain. Specifically, from 
a state {x,x c }, for each position i € [1, . . . , k], exactly one of x or x c will be able to flip 
the bit in position i, so one may imagine the pair {x,x c } as flipping at rate 1/2 in every 
position. This chain has the uniform distribution as the unique stationary distribution. 

2 Statement of main result and lemmas 

In this section we state the results that we do know how to prove, namely bounds on the 
lim inf and lim sup speeds. The following theorem is to be interpreted as referring to the 
lim inf and lim sup speeds (until we have proved the speed exists). 

Theorem 2 The speed of the drift of the leftmost 1 satisfies 

1.646 < fiinf-spd < fisup-spd < 2.92 . 

Relating back to the performance of the random edge rule on Klee-Minty cubes, we 
have: 

Corollary 3 For sufficiently large n, starting from a uniform random vertex of the Klee- 
Minty cube, 

0.086n 2 < E n < 0.152ra 2 . 
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Proof: Gartner, Henk and Ziegler consider another way of counting steps, where instead of 
choosing an edge at random among all those decreasing the objective function, they choose 
an edge at random from among all edges, but suppress the move if the edge increases the 
objective function. For a vector x of O's and l's of length n, let L*(x) = L*(x, n) denote the 
expectation of the number of moves starting from x r , including the suppressed moves, 
before the minimum is reached. They prove the following identity GHZ98, Lemma 4]. 

s "=ii>*( xr ' re ) ( 2 ) 

■r=l 

where x r is the vector of length r consisting of all l's. Including suppressed moves in the 
count corresponds in our infinite, continuous-time model to counting the number of clock 
events (only among the first r vertices). Let be the time it takes starting from x r to 
reach the minimum. By the strong law of large numbers, /T^ ~ r as r — > oo. Also by 
the strong law, if the liminf and limsup speed are known to be in the interval (a, b), then 

T< Tir) < r - 
b a 

for sufficiently large r. Hence, for sufficiently large r, 

am < N (r) < vl 

b a 
and plugging into (J5J) and summing from r = 1 to n gives 

n 2 ^ n 2 

-T7<E n < — 

46 4a 

for sufficiently large n. Plugging in a = 2.92 and b = 1.646 proves the corollary. □ 

The lower bound of Theorem|2is proved in Section|lJ The lower bound may in principle 
be improved so as to be arbitrarily near the actual speed. For the upper bound, we state 
some lemmas. Let 
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i^ 3 

denote the k th harmonic number. Let {S n } be a random walk whose increments are equal 
k with probability 2/((/c + l)(k + 2)) for each integer k > 1. Let S := Sg~i where G is an 
independent geometric random variable with mean 2. Let be a random variable satisfying 

°° 1 1 H 
P(e>j) = l-Fe(j-l) = ElI^ = -' 

^ — ' k k + 7 7 

k=l J J 
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Assume {S n } and are independent of each other and of {J~t}; denote expectation with 
respect to P x by E x and let E denote expectation with respect to the laws of S and 0. 
Analogously with fizeros(x) we define the quantity 

fizeros*(x) := —1 + inf{j > Bones(x) : x(6ones(x) — 1 + j) = 1} 

to be the number of zeros after the first block of ones; thus Bzeros*(x) > 1, 8zeros*(x) = 
fizeros(x) if and only if fizeros(x) > 1 and fizeros(x) = if and only if Bones(x) > 2. 

Lemma 4 For any x G ftonesj, 

i 1 

E^fijump 1 = - . 

k=l 

Equivalently, for any x£H, 

EJJjum Pl = H fiones{x) . 

Lemma 5 For any j > 1, any x 6 fizerosj, and any integer L > 1, 

P x (i?ones(yi) > L) < P(5 + j > L) . 

When flzeros(x) = then 

P x (i?ones(yi) > L) < P(G + fi zeros* (x) • B + S > L) 
where B is a Bernoulli with mean 1/2, and Q,B and S are all independent. 

Since S + + fizeros*(x) is an upper bound for both quantities S + fizeros*(x) and 
S + + l?Bzeros*(x) appearing as stochastic upper bounds in Lemma |SJ and since H n 
increases in n, we may put this together with Lemma 0] to obtain 

Corollary 6 For any x, 

E x fijump 2 < E# 0+jfeeros * (x)+5 . 

□ 
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Lemma 7 The conditional distribution of fizeros* (Y n ) given T Un _ x is always bounded above 
stochastically by the law of®. In other words, for all j, 

¥(fizeros*(Y n ) > j | T Cn _ x ) < 1 - F e (j) . 

Proof of Theorem [2] from the lemmas: It suffices to show that for any x£S, 

E x fijump 3 < 2.92 . 

We simply iterate conditional expectations and compute. By the Markov property, and 
Corollary |BJ 

E x fijump 3 = ExEy-jfijumpa 

< E x (E 2/ iI s+0+fizeros * (a) ) |^ =yi . 

Since H n is increasing in n we may use the stochastic upper bound in Lemma Q for any x 
to see that this is at most Ei/ s+e ( 2 ) where is the sum of two independent copies of 0. 
The upper bound in Theorem |2] is completed by computing an upper bound for this. 

The function H is concave and 0( 2 ) > 1, so 

H (6< 2 ) + j) - H{Q^) < H(j + 1) - 1 
and we may therefore conclude that 

E# e(2 ) +S < EHq(2) + EH s+ i - 1 . (3) 

To compute the quantity Ei? e ( 2 ) , let Gi and Q2 independently have the distribution of 
B and write 

EH ei+ e 2 = 1 + E ^T^ 01 ^ j ^ &2 = k ^ 

l<j,k J 



1 Hj ( H k H k+ i 



i<j,k j + k j V k k + l 
^ j + k j \ k k k k + l 

l<j,k 



l<j,k 



1 Hj Hk+\ — 1 
j + k j k(k + l) 
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We may evaluate this numerically as just under 2. 

For the last term on the RHS of ©, let <ft(z) = Kz s = E^°=o Z " IP (* S ' = n ) be the 
generating function for S, so Z(ft is the generating function for 5 + 1. For any positive 
integer j there is an identity 

/ dz= / {! + ■■■ + z j - 1 )dz = H j . 

Jo I-* Jo 

Consequently, we may write 

Eg s+1 -1 = f 1 "^'^ /'s^f^U. (4) 



Jo 1-z Jo 1-2 

To compute the generating function (ft, first compute the generating function / for the 
increments of {S n }: 

= f> 2 2z-z*-2(l-z)lo gj L- z 

^ (* + !)(* + 2) 2 2 

Then <^> = 1/(2 — /) and the integral in (jlj) becomes 

Z" 1 2z(log y^— — z) 

KH S+ 1= / — ; 1 ^ - ^ ^dz. 

Jo 3z 2 - 2z + 2(1 - z) log 

One may evaluate this numerically to approximately 0.918797. Adding this to the value for 
Effo(2) gives a little under 2.92. Rigorous bounds may be obtained with more care. □ 



3 Proofs of Lemmas 

Proof OF LemmaD By definition, each x G Bones-,- begins with j l's in positions 0, . . . , j— 1 
followed by a zero. The evolution of {-M*} decreases the binary representation Ylk 2 -fc x(/c), 
whence -^gj um p- £ fionesfc for some fc < j, that is, there is always a zero in some position 
in [0,j]. Furthermore, once there is a zero in position k for some k < j, then there is always 
a zero at or to the left of position k. It follows that fijumpi is equal to the least k < j 
for which ^- j < £o,l> that is, for which the clock in position k goes off before the clock at 
position 0. The minimum is taken to be j if there is no such k. 
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It follows that P x (Bjump 1 = j) = 1/j and that for < k < j, 

«™"= J ' = iFi)' < 5) 

To see this, note that 8jump 1 = k if and only if £0,1 is the minimum of the exchangeable 
variables {Ck,i ■ < k < j}. Similarly, for < k < j, Bjumpi = k if and only if ^,1 is 
the minimum of the variables {£, r ,i '■ < r < k} and £0,1 is the next least of the values. 
Computing expectations via © proves the lemma. □ 

Proof of Lemma [7| Again let us denote q = Bones(x). We recall that P(fijump 1 = k) = 
1/q for k = q and l/(k(k + 1)) for 1 < k < q — 1. We claim that for any k < q, 

P(fizeros*(yi) > j I 6jum Pl = k) < . (6) 

k + J 

If we can show this, then we will have 

P(e w(y,)>,)<Ji±i + g^|±i. 

Changing q to q + 1 increases this by 

(? + i)(<7 + i)(<7 + i + i) 

which is nonnegative. Setting q = 00 then yields the upper bound in the lemma, and it 
remains to show ©. 

Observe first that it suffices to show this for k = q. This is because when k < q, the event 
{8jump 1 = k} necessitates £k,i = m i n {£r,i : < r < k}. Thus to evaluate P(fizeros*(Yi) > 
j I fijumpx = k) we may wait until time 1, at which point if no bit to the left of k has 
flipped yet, the new conditional probability P(Bzeros*(Yi) > j \ T^ h 1 , fijumpx = k) is always 
at most (k + l)/(k + j) by applying the claim for q = k. 

Assuming now that k = q, we note that the event {Bjumpj^ = k} that we are conditioning 
on is just the event 

A- ■= {£0,1 = min 

0<i<fe-l 

is the event that the clock at goes off before any clock in positions 1, . . . , k— 1. Conditioning 
on A then makes the law of £0,1 an exponential of mean 1/k without affecting the joint 
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distribution of {£, r ,s ■ r > k}. Now let mi be the position at time £k,i of the first 1 to the 
right of k, and let t± be the time this 1 flips. Inductively, define m r +i to be the position of 
the first 1 to the right of k after time t r and let t T+ \ be the first time after t r that this 1 
flips. 

If the positions m r , . . . , m r + j — 1 are not filled with ones at time i r _i (define to = 0) 
then it is not possible to have Bzeros*(Y"i) > j and A and t r _i < £o,i < t r . That is, one 
cannot get from fewer than j ones in the first block of ones to the right of k to at least j 
ones at the time of the flip at without having the leftmost one in this block flip. On the 
other hand, if these j positions are filled with ones at time t r _i, then 

P(Bzeros*(Yi) > j,£ ,i < U \ A, T tr _ x , f ,i > *r-i) < 

k + j 

since the event {fizeros*(Yi) > j, £o,i < t r } requires that the clock at go off before the 
clocks in positions m r , . . . , m r +j — 1 (recall that conditioning on A has elevated the rate of 
the clock at to rate k). Similarly, P(£o,i < t r \ A,J 7 tr _ 1 ,!;o j i > i r -i) = k/(k+l). Therefore, 

P(fizeros*(yi) > j, g ,i < t r \ A, ^^,£0,1 > t r _i) fe + 1 
F(6,i < t r I A, T tr _ x , £0,1 > *r-i) ~ k + j' 

By the craps principle, the RHS is then an upper bound for the probability of fizeros*(Yi) > 
j conditioned only on Bjump! = k. □ 

Proof of Lemma |SJ Let x € Bzerosj and first assume j > 1. We prove the statement for 
n = 1, the proof for greater n being identical, conditioned on J- Un _ 1 . It is simple to check 
whether fiones(Yi) = j. The bits in positions 1, . . . , j will remain 0's until the leading 1 
flips at time a%, so the only thing to check is whether a\ = £0,1 is less than or greater than 
With probability 1/2, £0,1 < £7+1,1 an d in exactly this case Bones (Yi) = j. 

Now condition on this inequality going the other way: £01 > £7+1,1 • Let t\ := £^1. Let 
j + 1 + k\ be the position of the first of x to the right of position j + 1. Then at time t\, 
the position of the first one to the right of j + 1 is j + Z±, where Z\ is the least I £ [1, k\ — 1] 
for which £,+1+^1 < £7+1,1- If no such I exists, then Z\ = k\. We compute P x (^i = I) as 
follows. 

The variables {£ r ,i : r G {0}U[j+l, j+/ci]} are independent exponentials. For 1 < I < k%, 
the event that fiones(Yi) 7^ j and Z\ = I is the intersection of the event A that £/+i,i is less 
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than £0,1 and £r,i f° r all j + 2 < r < j + / with the event B that £ J+ i + ^i < In other 

words, among 1 + 2 independent exponentials, the index of the least and second least must be 
j + 2 and j + 1 respectively. The unconditional probability of this is l/((Z+l)(Z+2)). Having 
conditioned on the larger event < £o,i}> the conditional probability is therefore equal 

to 2/((7+l)(/+2)). This holds for I < k\, where Z\ = k\ with the complementary probability 
To sum up, Z\ is distributed as S\ A k\ where S\ has the distribution of the random walk 
increments described in the lemma. 

The last step is to invoke the Markov property. Condition on F tl . The chain from here 
evolves under the law Px(ii)- Iterating the previous argument, there are two cases. The 
first case, which happens with probability 1/2 is that the clock at the origin goes off before 
the next alarm at location j + 1 + Z\. In this case, Bones(Yi) = j + Z\. In the alternative 
case, we let t 2 be the time at which the clock at location j + 1 + Zi next goes off. We let 
Z 2 be the number of consecutive l's at time t 2 starting from position j + 1 + Z 1 . Then 
conditional on T tl , Z 2 is distributed as Si A k 2 where k 2 is the number of consecutive l's 
at time t\ starting at position j + 1 + Z\. 

Iterating in this way, we have the following inductive definitions. Let to = 0. Let r 
be the least r for which the clock at the origin goes off after time t r but before the first 
alarm at location j + 1 + Ya=i F° r each r < r, we may define k r to be the number 
of consecutive l's at time t r -\ starting at location j + 1 + Yll=i ^i- We may then define 
t r to be the first time after t r -\ that the alarm at location j + 1 + Yll=i %i g° es or b an d 
we may define Z r so that j + 1 + Yll=i is the location of the first zero to the right of 
3 + l + YHZ{Zi at time t~ . 

The upshot of all of this is that 

r 

fiones(Yi) =j + ^Zi 

i=l 

and that the joint distributions of r and {Zi : 1 < i < r} are easily described. Conditioned 
on r > r and on T tr , the probability of r = r + 1 is always 1/2; as well, Z r+ \ given 
r > r + 1 and Tt r is always distributed as a truncation of S\. We conclude that Bones (Yi) 
is stochastically dominated by the sum of r independent copies of Si, hence as Sq-i- 

Finally, we consider the case where fizeros*(x) = I > Bzeros(x) = 0. Let q = Bones(x), 
so that x begins with q ones, followed by I zeros, followed by a one in position q + I. A 
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preliminary observation is that if we begin with a one at the origin, the position W(t) of 
the leading one at a later time t is an increasing function of t; hence, if T M is an independent 
exponential with mean fi, the distribution of W{T^) is stochastically increasing in ji. 

Begin by writing 

P x (6ones(Yi) > j) = ^ P(fiones(Yi) > j,Bjum Pl = k) . 
k=l 

Let I* denote the number of zeros consecutively starting from position k at time i if 
fijump]^ = k < q, and I* = I if k = q. In other words, I* = fizeros*(x') where x' is the 
word at the last time t that fiones changes before the leading bit flips (t = Cfijump i ^ 
fijump]^ < q and t = otherwise). We may then describe fiones(Yi) as I* + W, where W is 
the number of consecutive positions starting from position 8jump 1 + I* that turn to zeros 
between time t and £0,1 • Now we break into two cases. 

Condition first on {8jump 1 = q}. The time £0,1 is now an exponential of mean 1/g, 
and before this time, the bits from position q + I onward evolve independently. We may 
describe fiones(Yi) as I + W(£o,i)) where W is the number of consecutive positions starting 
at q + I which have become zeros in the time from to £0,1 • The first part of this lemma 
established that when £ has rate 1, then W(£) ^ S. Our preliminary observation now 
shows, conditional on {Bjump-L = q}, that W(£o,i) ^ S). 

Next, let us condition on fijumpj^ = k < q, obeserving that then I* < q — k. In order to 
have 1* > r, it is necessary that = min{£ Si i : k<s<k + r — 1}. Having conditioned 
on fijump]^ = k, the distribution of £fc,i becomes an exponential of rate k + 1, so that the 
conditional probability of this clock going off before r — 1 other conditionally independent 
clocks of rate 1 is just (k + l)/(fc + r). Since the event {6jump 1 = k < q} has probability 
l/(k(k + 1)) if k < q and zero otherwise, we may remove the conditioning and sum to get 

P(f > r, fijum Pl < q) < j£ j^y^ < P(f * > r) . 

We also still have in this case W r< S. 

Putting together the cases 8jump 1 = q and fijumpj^ < q, we see that /* = / with proba- 
bility l/q and otherwise I* ^ 0. The crude bound l/q < 1/2 gives I* ^ + fizeros*(x) • B. 
Since I* G cr(JT t ) and the bound W ^ 5 holds conditionally on Tt, we arrive at Bones(Yi) < 
e + fizeros*(x) ■ B + S. □ 
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4 Lower bound 



To arrive at a lower bound, consider a tree whose vertices are positive integers, identified 
with their binary expansions. The root is 1, and the children of x are 2x and 1x + 1. 
Associated with each node x are a set of n{x) possible transitions, where n(x) is the number 
of l's in the binary expansion of x. The transitions are to numbers gotten by flipping a 
1 and simultaneously flipping all bits to its right. Note that all transitions from x are to 
numbers less than x. Let r{x) denote the reward if the leading bit of x is flipped, namely 
one less than the number of leading l's in the binary expansion of x. Recursively, we assign 
to each x a mean reward and maturation time, (a(x),b(x)) as follows. Fix a set B of bad 
nodes, to be determined later. On first reading, take B to be empty. For any x, let y±, . . . , y n 
be the possible transitions from x. Let (a(x), b(x)) := (0, 0) if x € B and otherwise, let 



Let T be any finite binary rooted subtree, meaning that any vertex in the subtree has either 
zero or two children in the subtree. 

Lemma 8 Suppose no leaf of T is in B. Then an almost sure lower bound for the lim 
inf speed from any starting configuration is given by the minimum over leaves x of T of 
1 + a(x)/b(x). 

The lower bound in Theorem |2] will follow from Lemma |S] together with an implemen- 
tation of the recursion. Below is some code written in C that implements the recursion for 
a complete binary tree of depth 15, with the set B chosen to give a good bound without 
much trouble. 
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#include <stdio.h> 
#define N 18 
#define M 524288 

mainO { 

float a[M] [2] ; 

int i , j , k, n, min_index, locmin, power[N+2], bitcount=0, lastcount=0; 
float min_ratio; 

float locrat; 
float average; 

a[0] [0] = 0; a[0][l]=0; a[l] [0] = 0; a[l] [1] = 0; power [0] = 1; power[l]=2; average = 0; 
/* the n loop stratifies the recursion by levels of the tree */ 
for (n=l; n <= N; n++) 
{ 

power [n+1] = 2*power[n]; 

/* the i loop indexes vertices inside a level */ 

for (i=power[n]; i < power [n+1]; i++) 

{ 

bitcount = 0; lastcount=0; a[i] [0] =0; a[i] [1] =0; 

for (j=0; j <= n; j++) 

{ 

k = i" (power [j]) ; 

if (k < i) 

{ 

/* k < i iff there is a 1 in position k */ 

k = power[j + l]*(i / (power [j+1] ) ) + power[j+l] - 1 - i % power[j + l]; 
a[i] [0] += a[k] [0] ; 
a[i] [1] += a[k] [1] ; 
bitcount ++; 

} 

else lastcount=bitcount ; 

/* lastcount will eventually say how many l's after the leftmost zero */ 

} 

a[i] [1]++; 

a[i] [0] += bitcount - lastcount - 1; 
a[i] [0] = a[i] [0] / bitcount; 
a[i] [1] = a[i] [1] / bitcount; 

/* a particular choice of the set B is given in literal form */ 
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if ( 



i==168 
i==180 



i==672 



{ 

a[i] [0] 



i==2 

i==9 

i==18 

i==36 

i==40 

i==73 

i==80 

i==88 



i==4 I I 
i==10 I I 
i==19 I I 
i==37 I I 



i==5 
i==ll 
i==20 
i==38 I 



I I i==41 I I i==42 
| i== 7 4| | i==7 5 | | 

I i==81 I I i==82 I I 
I i==89 I I i==90 I 



I i==8 I I 
I 

I i==21 I I i==22 I I i==23 I I 

I i==43 I I i==44 I I i==45 I I i==46 I I i==47 

i==83 I I i==84 I I i==85 I I i==86 I I i==87 I I 
! i==91 I I 



i==160 I I 


a n a II 

i==161 I I 


i 


==162 I I 


i= 


= 163 I I i 


==164 I I i= 


A n v II 

=165 1 1 i== 


A f f II ' A f T II 

166 I I i==167 I I 


1 i==169 


II- a —rr\ 

\ \ i==170 


1 


A —7 A 

i==171 


1 


A -71-1 1 1 

i==172 I I 


i==173 I I 


i==174 I I 


■ A *~7 1 — II* A *~7 f\ 1 1 

i==175 1 1 i==178 I I 


1 i==181 


1 i==182 


1 

1 


i==183 


| 

1 










i==324 I I 


i==325 I I 


i 


==326 I I 


i= 


=328 I I i 


==329 I I 






i==330 I I 


i==331 1 1 


i 


==332 I I 


i= 


=333 I I i 


==334 I I i= 


=335 I I 




i==336 I I 


i==337 I I 


i 


==338 I I 


i= 


=339 I I i 


==340 I I i= 


=341 I I i== 


O /I i~l II • o /I o II 

342 I I i==343 1 1 


i==344 I I 


i==345 1 1 


i 


==346 I I 


i= 


=347 1 1 i 


==348 I I i= 


=349 I I 




i==350 I I 


i==351 I I 
















i==361 I I 


i==362 | | 


i 


==363 I I 












i==656 I I 


r> i- II 

i==657 I I 


i 


==658 I I 


i= 


=659 I I i 


==660 I I i= 


=661 I I i= 


=662 I I i==663 I I 


1 i==673 


1 i==674 


1 


i==675 


1 


i==676 I I 


✓""-7-7 1 1 

i==677 I I 


i==678 I I i 


==679 I I 


i==680 I I 


i==681 I I 


i 


==682 I I 


i= 


=683 I I 








i==684 I I 


i==685 I I 


i 


==686 1 1 


i= 


=687 I I i 


==688 1 1 i= 


=689 I I 




i==690 I I 


i==691 I I 


i 


==692 I I 


i= 


=693 1 1 i 


==694 I I i= 


=695 I I i== 


697 I I i==698 1 1 i== 


i==1322 I 


1 i==1323 


1 














i==1346 I 


1 i==1347 


1 


i==1348 


1 1 


i==1349 








i==1350 I 


1 i==1351 


1 


i==1352 


1 1 


i==1353 


1 I i==1354| 


1 i==1355 1 


1 


i==1356 I 


1 i==1357 


1 


i==1358 


1 1 


i==1359 








i==1360 I 


1 i==1361 


1 


i==1362 


1 1 


i==1363 


1 1 i==1364 


1 1 i==1365 




i==1366 I 


1 i==1367 


1 


i==1368 


1 1 


i==1369 


1 I i==1370 


1 I i==1371 




i==1372 I 


1 i==1373 


1 


i==1374 


1 1 


i==1375 


1 1 i==1380 


1 1 i==1381 




i==1384 I 


1 i==1385 


1 


i==1386 


1 1 


i==1387 


1 1 i==1388 


1 1 i==1389 


1 1 i==1390 I I i==1391 


i==2704 I 


1 i==2705 


1 


i==2706 


1 1 


i==2707 


1 I i==2708 


1 I i==2709 


1 I i==2710 1 1 i==2711 


i==2720 I 


1 i==2721 


1 


i==2722 


1 1 


i==2723 


1 I i==2724 


1 1 i==2725 


1 I i==2726 I I i==2727 


i==2728 I 


1 i==2729 


1 


i==2730 


1 1 


i==2731 


1 I i==2732 


1 1 i==2733 


1 I i==2734 I I i==2735 


i==2736 I 


1 i==2737 


1 


i==2738 


1 1 


i==2739 


1 I i==2740 


1 I i==2741 


1 I i==2742 I I i==2743 


i==2745 I 


1 i==2746 


1 


i==2747 


1 1 










i==5456 I 


1 i==5457 


1 


i==5458 


1 1 


i==5459 


1 I i==5467 






i==5460 I 


1 i==5461 


1 


i==5462 


1 1 


i==5463 


1 I i==5465 


1 I i==5466 


) 



0; a[i] [1] = 0; 
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} 

if (i == (M/2)) 
{ 

min_index = i; 

min_ratio = a[i] [0]/a[i] [1] ; 
} 



if (i > (M/2)) 
{ 

if ((a[i] [0]/a[i] [1] ) < min_ratio) 
{ 

min_index = i; 

min_ratio = a[i] [0]/a[i] [1] ; 

} 

} 

if (i > (M/2)) 

{ 

average = average + (a[i] [0] /a[i] [1] ) ; 
} 

if (i == 3) 
{ 

locmin = i; 

locrat = a[i] [0]/a[i] [1] ; 
} 

if (i < (M/2)) 
{ if (i > 71) 
{ 

if (a[i] [1] > 0) 

{if ((a[i] [0]/a[i] [1]) < locrat) 

{ 

locmin = i; 

locrat = (a[i] [0])/(a[i] [1]); 

> 

} 

} 

} 

} 

printf ("\n") ; 
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printfC'Row with minimum ratio is:\n"); 

printf ("min_index=7 i,exp. reward=7 f , exp. time=°/ f, ratio=°/ f \n" ,min_index, 

a[min_index] [0] , a[min_index] [1] , a[min_index] [0] /a[min_index] [1]); 
printfC'Row with local minimum ratio is:\n"); 

printf ("locmin=7,i , exp . reward=°/ f , exp. time=°/ n f, ratio=7f \n" , locmin, locrat, average, 
» 

A look at the data shows the minimum value of a(x)/b(x) on each level to be obtained 
when the binary expansion of x alternates. In particular, the global minimum is at x = 
349525 and has value 0.646 . . ., which proves the lower bound. 

Proof of Lemma |SJ Any finite rooted binary subtree induces a prefix rule, that is, a map 
r\ from infinite sequences beginning with a 1 to leaves of T, defined by r/(x) = w for the 
unique leaf of T that is a prefix of rj. 

Given a trajectory of the Markov chain {A^}, define a sequence of elements of T as 
follows. Let xq := rj(Xo) be the prefix of the initial state of the trajectory. Let to := 0. As 
the definition proceeds, verify inductively that for < t < r^+i, the string x k will be an 
initial segment of Xt- The recursion is as follows. Let t^+i be the first time after r& that 
a 1 flips in the initial segment x^ of Xt- Let x 1 ^,^ be the string gotten from x\~ by flipping 
this bit and all bits to its right. If x' k+l ^ B and x' k+1 is not the zero string then let x^+i 
be x' k+l , stripped of any leading zeros. If x' k+l € B or x' k+l is the zero string, then let 
x k+1 = f]{X Tk+1 ). 

Let po, pi, . . . be the successive times that this latter transition occurs, that is, successive 
times t in the recursion that r](X t ) is computed. Given x G T, suppose we begin counting 
every time the sequence {x k } hits x and stop counting at every time p^. When we are 
counting, we count how many times the leading 1 flips, and how many l's flip together 
when this happens, or more precisely, we keep a cumulative count, each time adding one 
less than the number of l's that have flipped together. Let this cumulative count be denoted 
(A(x, t), B(x, t)), where A counts flips of the leading 1. If x € B, then by convention we do 
not count anything. 

Claim: For each x € T\B, if x is visited infinitely often then the ratio of A(x, t) /B(x, t) 
converges as t — > oo to a(x)/b(x). Proof: Conditioned on the past, each time we start 
counting we are equally likely to transition to each of the n{x) possible transitions. The 
claim then follows from induction on x. 
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Each x defines a set of time where x is on, that is, where we are counting leading 1 
flips and rewards for that x. As x ranges over the leaves of T, the on-timesets for x 
partition [0, oo). In each such time set, we have shown that the mean number of recorded 
simultaneous 1-flips per leading 1 flip is 1 + a{x)/b(x) in the limit, as long as the time set 
is unbounded. Since the partition is finite, it follows that the lim inf speed is at least the 
minimum of the values. □ 

We remark that the only place this computation is not sharp is when the number of l's 
flipping together exceeds the number recorded, because the present knowledge of the prefix 
was a string of all l's and there were more l's after this that also flipped. Thus by making 
the tree T big enough, even without increasing B, we can get arbitrarily close to the true 
value. 



5 Further observations 



The following argument almost solves Problem (3a), and perhaps may be strengthened to 
a proof. Lemma 4 of GHZ98 is proved by means of a duality result. The result is that the 
probability, starting from a uniform random state, of finding a 1 in position r after t steps 
(counting suppressed transitions), is equal to half the probability that x r has not reached 
the minimum yet after t steps (again counting suppressed transitions). The argument that 
proves this may be generalized by introducing a simultaneous coupling of the process from 
all starting states. The probability, from a uniform starting state, of finding a 1 in every 
position in a set A after t steps, is then the expectation of the function that is zero if the 
column vector of all l's is not in the span of the columns of the matrix whose rows are the 
states reached at time t starting at x r , as r varies over A, and is 2~ u if the column vector 
of all l's is in the span and the matrix has rank u. The kernel of the matrix is the set 
of starting configurations that reach the minimum by time t (the simultaneous coupling is 
linear). Hence, as long as A and t are such that the probability of reaching the minimum 
from any x r by time t goes to zero, the rank of the matrix will be |^4| and the probability 
of finding all l's in positions in A at time t will go to 2~I" 4 L In particular, if a window of 
fixed size moves rightward faster than the limsup speed, then what one sees in this window 
approaches uniformity. This is not good enough to imply uniformity of a window a fixed 
distance to the right of the leftmost 1. 
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